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Abstract 

"Spikes are the neural code": this claim is about 15 years old (Shadlen & Newsome, 1994; Rieke, Warland, Steveninck, 
& Bialek, 1996), preceded by theoretical studies on the underlying mathematical processes (e.g., (Gerstein & Mandelbrot, 
1964)), and followed by many developments regarding biological modelling or computational paradigms, or both (e.g., 
(Thorpe, Delorme, & VanRullen, 2001)). However the involvement of spikes in neural coding is still an open subject. 
Several fundamental aspects of dynamics based on spike-timing have been very recently clarified, both at the neuron 
level (Touboul & Brette, 2008) and the network level (Cessac & Vieville, 2008). Nevertheless, still a non negligible set 
of received ideas, as, e.g., the "incredible power of spikes" or, e.g., the "mystery of the [spike based] neural code" (sic !) 
are currently encountered in literature. 

In this article, our wish is to demystify some aspects of coding with spike-timing, through a simple review of well- 
understood technical facts regarding spike coding. The goal is to help better understanding to which extend computing 
and modelling with spiking neuron networks can be biologically plausible and computationally efficient. 

We intentionally restrict ourselves to a deterministic dynamics, in this review, and we consider that the dynamics of 
the network is defined by a non-stochastic mapping. This allows us to stay in a rather simple framework and to propose a 
review with concrete numerical values, results and formula on (i) general time constraints, (ii) links between continuous 
signals and spike trains, (iii) spiking networks parameter adjustments. 

When implementing spiking neuron networks, for computational or biological simulation purposes, it is important to 
take into account the indisputable facts here reviewed. This precaution could prevent from implementing mechanisms 
meaningless with regards to obvious time constraints, or from introducing spikes artificially, when continous calculations 
would be sufficent and simpler. It is also pointed out that implementing a spiking neuron network is finally a simple task, 
unless complex neural codes are considered. 

Key Words Spiking neuron network. Neural code. Time constraints. Spike train metrics. 

1 Introduction 

Let us consider, for instance, biological models of cortical maps (Koch & Segev, 1998; Dayan & Abbott, 2001), in a 
context where the spiking nature of neurons activity is made explicit (Gerstner & Kistler, 2002b), either from a biological 
point of view or for computer simulation. From the detailed Hodgkin-Huxley model (Hodgkin & Huxley, 1952), (still 
considered as the reference but unfortunately intractable when considering neural maps), back to the simplest integrated 
and fire (IF) model, a large family of continuous-time models have been produced, often compared with respect to their 
(i) biological plausibihty and their (ii) simulation efficiency. 
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Theoretically, spiking neurons can perform very powerful computations with precise spike timings. Spiking neurons 
are at least as computationally powerful as the sigmoidal neurons traditionally used in artificial neuron networks (Maass, 
1997; Maass & Natschlager, 1997). This result has been shown using a spike-response model (see (Maass & Bishop, 
2003) for a review) and considering piece-wise linear approximations of the membrane potential profiles. In this context, 
analog inputs and outputs are encoded by temporal latencies of spike firings. It has been shown that any feed-forward 
(multi-layer) or recurrent analog neuronal network (e.g. Hopfield network) can be simulated arbitrarily closely by an 
insignificantly larger network of spiking neurons. The assertion holds even in the presence of noise (Maass, 1997; Maass 
& Natschlager, 1997). Such theoretical results highly motivate the use of spiking neuron networks for modelling and 
simulation purpose. 

Biological plausibility of neuron network models. 

Biological plausibility at the neuron level is understood as the ability to reproduce what is observed at the cell level, often 
considering in-vitro experiments (Koch & Segev, 1998). The point of view is questionable as shown in recent experiments 
in VI (Fregnac, 2003, 2004) where it appears that a single-cell observation highly differs between in-vitro and in-vivo 
conditions. Biological plausibility at the network level is understood as the abiUty to reproduce what is observed regarding 
e.g. the cortical map activity (Carandini et al., 2005). This includes predicting the response not only to specific artificial, 
but also natural stimuli: this means, for VI, taking into account natural image sequences input shifted by eye movements 
(Baudot, 2007), after the retinal and LGN processing (see e.g. (SimoncelU & Olshausen, 2001) for a discussion about 
information processing in these structures). 

As far as this contribution is concerned, we consider a weaker notion of biological plausibility: A simulation is 
biologically plausible if it verifies an expUcit set of constraints observed in biology. More precisely, we are going to 
review and discuss a few time constraints, shared by all dynamics, further called "general time constraints". We develop 
their consequences at the simulation level. The time constraints are based on biological temporal limits and appear to be 
very precious quantitative elements, both for estimating the coding capacity of a system and for improving simulations. 

Simulation efficiency of integrate and fire models. 

Among all the spiking neuron models, the punctual conductance based generalized integrate and fire (gIF) is an adaptive, 
bi-dimensional, non-linear, integrate-and-fire model with conductance based synaptic interaction (as e.g. in (Destexhe, 
1997; Brette & Gerstner, 2005; Rudolph & Destexhe, 2007)). At the present state of the art, considering gIFs as neuron 
models presents several advantages: 

- They provide an effective description of the neuronal activity allowing one to reproduce several important neuronal 
regimes (E. Izhikevich, 2004), well matching to biological data, especially in high-conductance states, typical of cortical 
in-vivo activity (Destexhe, Rudolph, & Pare, 2003). 

- Nevertheless, they consist of a simplification of Hodgkin-Huxley models, which is useful both for mathematical 
analysis and numerical simulations (Gerstner & Kistler, 2002b; E. Izhikevich, 2003). 

In addition, though these models have mainly been considered for studying the dynamics of a single neuron, they are 
easy to extend to a network structure, including synaptic plasticity modelling (Markram, Ltibke, Frotscher, & Sakmann, 
1997; Pfister & Gerstner, 2006). 

See, e.g. (Rauch, La Camera, Luscher, Senn, & Fusi, 2003) for further elements in the context of experimental 
frameworks and (Camera, Giugliano, Senn, & Fusi, 2008a, 2008b) for a review. 

However, in all the variants of integrate and fire models, it is assumed that an instantaneous reset of the membrane 
potential occurs after each spike firing, except for the Spike Response Model of (Gerstner & Kistler, 2002b). The reset 
is a formal simplification and has a general spurious effect: Information theory (e.g. Shannon's theorem, stating that the 
sampling period must be less than half the period corresponding to the highest signal frequency) is not applicable with 
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unbounded frequencies. From the information theory point of view, it is a temptation to relate this spurious property to 
the erroneous fact that the neuronal network information is not bounded. In the biological reality, time synchronization is 
indeed not instantaneous (action potential time-course, synaptic delays, refractoriness, ...). 

What is the paper about 

In section 2, we emphasize the fact that, in computational or biological contexts, not all time sequences correspond to 
realistic spike trains since they are constrained by the neural dynamics, while general time constraints are also to be taken 
into account. We revisit this apparently obvious point and provide numerical evaluations. One of the constraints we 
propose is far from being obvious and we discuss that point in detail. 

In section 3, we make explicit the maximal amount of information present in a "true" spike train, taking the general 
time constraints into account. This point of view contradicts what is usually implicitly assumed about the "incredible 
power of computating with spikes" and, although obvious, it is worth reminding us about the limitation we derive. 

In section 4, we review a recent work which clarifies the kind of dynamics encountered in deterministic integrate and 
fire neuron networks, demystifying the notion of "chaotic spiking network dynamics" and supplying a rigorous notion of 
what is called the "edge of chaos". 

In section 5, we discuss to which extend defining the "neural code" contained in spike trains is related to the choice of 
a metric, in the deterministic case, i.e. when the dynamics of the neuron network is defined by a non- stochastic mapping. 
The relation with existing neural codes (rate coding, rank coding, phase coding, ...) is discussed. 

As a first major consequence, considering convolution metrics in section 6, we can make exphcit, in the linear case, 
the links between spike trains and continuous signals, with concrete methods to build such a link. 

As a second major consequence, considering alignment metrics in section 7, we can describe methods to explicitly 
program spiking neuron network parameters in order to obtain a given input/output relation in the deterministic case 
again. 

2 General time constraints in spike trains 

The output of a spiking neuron network is a set of events, defined by their occurrence times, up to some precision: 

J" = {• • • • • • } with t] <f:j <■■■ <ti <■■■ ,yi,\/n 

where is the nth spike time of the neuron i, with related inter-spike intervals d" = — t^~^. See e.g. (Dayan & 
Abbott, 2001; Gerstner & Kistler, 2002b; Schrauwen, 2007; Paugam-Moisy & Bohte, 2009) for an introduction to spiking 
neuron networks. 

In computational or biological contexts, not all sequences correspond to spike trains since they are constrained by 
the neural dynamics. In computational or biological contexts, the following time constraints must be taken into account: 

[CI ] The inter-spike intervals are bounded by a refractory period r^, > r^, 
[C2 ] The spike times are defined up to some absolute precision 5t 

[C3 ] There is always a minimal delay dt for a pre-synaptic spike to influence a post-synaptic neuron, thus having a 
causal effect on another spike 

[C4 ] There is a maximal inter-spike interval D such that Vi, Vn either < or = -|-oo 
(i.e. either a neuron fires within a time delay < or it remains quiescent forever). 
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For biological neurons, orders of magnitude are typically, in milliseconds: 
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These numerical evaluations are discussed in the present section. 

The [CI] constraint is well-known as a limit for the maximal firing rate. See e.g. (Koch, 1999) for an extended dis- 
cussion on absolute / relative refractory periods. 

The [C2] constraint may correspond to more than one definition. For instance, probabilistic interpretations often 
consider an additive perturbation in the dynamic evolution, to encounter for the fact that spike times are not precisely 
defined. On the other hand, deterministic interpretations may consider precision intervals. Here, we propose a simple 
deterministic specification: 

Two spike times are different, e.g., not synchronized, if separated by more than 6t. 
Two spike times are indistinguishable if they are separated by less than St. 

Indistinguishable does not mean "equal", but that means we can not state if equal or different. With such a determin- 
istic interpretation, 5t can be calculated using 1st order approximations. The [C2] constraint is sometimes "forgotten" in 
models. In rank coding schemes for instance (Gautrais & Thorpe, 1998) it is claimed that "all" spike-time permutations 
are significant, which is not realistic since many of these permutations are indistinguishable, because of the bounded pre- 
cision, as discussed in e.g. (Vieville & Crahay, 2004). Similarly, a few concepts related to "reservoir computing" (see e.g. 
(Paugam-Moisy, Martinez, & Bengio, 2008) and quoted contributions, for a review) do not address this issue, although 
simulations indeed have to take it into account. As a consequence, an unrealistic unbounded time precision is implicitly 
assumed. 

Spike time precision evaluation 

Considering that the spike time of a real neuron is defined by the time ti when the membrane potential V{ti) 
reaches a maximum, we obtain around ti, assuming differentiability of V: 

v{t) = v{ti) + K{t- tif + o{\t - 

with K = d^V/dt^{ti) and easily derive, as a rule of thumb for the spike-time precision 5t: 

V < K > 

where 5V is the voltage precision and the averages <> are to be taken over a set of measurements. This 
formula is derived from standard 1st order error analysis. 

In order to roughly estimate spike time precision, we have considered a few dozen of spike profiles 
in several spike trains (Carandini & Ferster, 2000; Koch, 1999) and graphically estimated the values in a 
zoom of the provided figures. We have obtained St ~ 0.1ms, with a peak curvature order of magnitude 
< K >= lOOmV/ms^ as illustrated in Fig. 1, considering a voltage precision of average < 6V >= lOnV, 
i.e. at the order of magnitude of the membrane potential noise (Koch, 1999). Similar numerical values are 
obtained reading other electro-physiological data (Carandini & Ferster, 2000). 

Furthermore, a similar order of magnitude is obtained in literature, considering the numerical precision 
in inter-neuron synchronization (Crook, Ermentrout, & Bower, 1998) which is found of about 1ms, while 
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Figure 1 : Two examples of spike profiles in the cat primary visual cortex. The peak curvature order of magnitude are 

30 - lOOmy/ms^. 

(Mainen & Sejnowski, 1995) (e.g. in Fig.2B) report submiUisecond accuracy in vitro, but no higher than 
0.1ms. 

Similarly, [C3] is obvious and has for consequence to avoid every spurious effects', and induce simplifications both 
at the modelUng and simulation levels (Morrison, Mehring, Geisel, Aerstsen, & Diesmann, 2005). 




Spike time propagation evaluation 

Delays from one spike to another involve the pre-synaptic axonal delay, the synaptic delay and the post- 
synaptic dendritic delay. The smaller observed delays (Koch, 1999; Burnod, 1993) seem to be at least of 
0.5ms, with values up to 40 — 50ms for inter cortical maps transmissions. A step further, many local inter- 
neuronal connections in the cortex are realized through electrical gap junctions (Galarreta & Hestrin, 2001), 
this being predominant between cells of the same sub-population (Amitai et al., 2002). In such a case the 
inter-neuron delays are much smaller, but stiU measurable, since the transmission is mainly due to the spike 
potential raise, with a time constant of about 0.1 — 0.2ms (see (Lewis & Rinzel, 2003) for a discussion about 
the electrical transmission in this case). Then a reasonable assumption is to consider that local electrical 
connections are delayed by dt >~ 0.1ms. 

Gap junctions delays are much smaller {dt >~ 10/is) but still non negUgible (Lewis & Rinzel, 2003; 
Koch, 1999). 

The [C4] constraint is less obvious. The idea is that, in the absence of any input (isolated neuron), the potential 
decreases towards a resting potential and the neuron cannot fire anymore. This is true for usual deterministic models, 
except for singular internal currents^. This behaviour seems realistic for cortical neurons, but likely not for all neurons in 
the brain (Par, Bouhassira, Oakson, & Datta, 1990; McCormick & Bal, 1997). 

a neuron instantaneously fires after receiving a spike, this can generate avalanche efiects (another neuron instantaneously fires and so on) or even 
temporal paradoxes (another inhibitory neuron instantaneously fires inhibiting this one, thus not supposed to fire any more). 
^This is easy to illustrate considering a LIF model, where g and i are constant: 

{ Vit.)%%\t^Ve -*i=*o + f log (^) with. >,.>,yo. 
If the internal current verify: i > g (9 - Vo e^^ - ^^'^), [C4] is verified. 

Since C / g ^ 1 • ■ ■ 10ms, thus << 10^, it is sufficient to get i > (1 + 10~*) g d, i.e. a very small amount above g 9. It is thus a reasonable 

numerical assumption to assume that D is bounded. However, if i ^ g 9, the firing period becomes unbounded, yielding a spurious event (which can 
affect the whole dynamic) at an unbounded instant. This is a singular case. 
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Spike time upper-bound evaluation 



At the simulation level, [C4] is easily violated for deterministic neural models with constant internal current, 
able to integrate during an unbounded period of time, or with maintained sub-threshold oscillations. But 
this singular condition is easy to check and to avoid, and a maximal spontaneous firing period can be derived. 
Synaptic conductance based models (Destexhe, 1997) and spike response models (Gerstner & Kistler, 2002b) 
usually omit this constant current and their intrinsic "leak" guaranties that [C4] is not violated. On the 
contrary, with stochastic models, [C4] might be reconsidered, since there is always a "chance" to fire a spike, 
with a decreasing probability as time increases. 

At the biological level^, in vitro, a regularly spiking cortical pyramidal neuron, without synaptic input, 
remains silent since its membrane potential is close to the resting potential (Koch, 1999). In vivo, in the 
cortex, current observations show that a neuron is always firing (Dayan & Abbott, 2001) (unless it is dead). 
This is due to the large amount of neuromodulators, inducing depolarization and a membrane potential close 
to the firing threshold. However, this differs from [C4], where isolated neurons are considered. On the 
contrary, thalamic neurons can fire spontaneously after a long resting period (Par et al., 1990). Even in vitro, 
their internal currents such as IT (low threshold transient Ca2+ current) or IH (hyper-polarization-activated 
cation current) can induce spikes (due to oscillatory behaviors) (McCormick & Bal, 1997). 

As discussed in details in (Cessac & Vieville, 2008), the fact whether the constraint [C4] is verified or not completely 
changes the nature of the dynamics. In the latter case, a neuron can remain silent a very long range of time, and then 
suddenly fire, inducing a complete change in the further state of the system. We distinguish situations with and without 
[C4] in the sequel. 

Considering C[l-3] and optionally [C4], let us now review the related consequences regarding modelling and simula- 
tion. 

Simulation of time-constrained networks. The event-based simulation of spiking neuron networks (see e.g. (Brette 

et al., 2007) for a review) is strongly simplified by the fact that, thanks to [C2] and [C4] spike times and precisions are 
bounded, while thanks to [C3] spiking can not generate causal paradoxes. Here the specification allows to use "histogram 
based" methods^, with a small 0(1) complexity (Cessac, Rochel, & Vieville, 2009). 

Furthermore, the simulation core is minimal (a 10Kb C++ source code), using a 0(D/dt + N) buffer size and about 
0(1 + C + e/dt) ~ 10 — 50 operations/spike ( > 10^ spike/sec on a laptop), for a size N network with C connections in 
average. 

3 The maximal amount of information 

Considering [CI -2], given a network of N spiking neurons observed during a finite period [0, T], the number of possible 
spikes is obviously limited by the refractory period r. Furthermore, the information contained in all spike times is strictly 
bounded, since two spike occurrences in a St window are not distinguishable, and St < r. 
A rather simple reasoning yields a rough upper bound for the amount of information: 

N ^ log2 (J^) bits during T seconds 
Taking the biological values into account, a straightforward numerical derivation leads to about IKbits/ neuron, for 

'We are especially thankful to Dr. Thierry Bal, for a scientific discussion on this subject. 
''Source code available at http : / /enas . gf orge . inria . f r. 
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Information upper bound evaluation 



Let us consider a given neuron (the index number is omitted) and its first spike time f . The next spike 
firing of the neuron, (i) either occurs no later than + dt thus at a time not distinguishable from by an 
observer, (ii) or occurs at least 5t later. In order to be meaningful, spikes must thus occur in distinct temporal 
boxes of width St, the precise location of the box being fixed by the first time occurrence, as schematized in 
Fig. 2. Since there is a refractory period r > dt the second and next spikes will never be mixed with their 
predecessors but are going to be subject to the same limitation. 

I I SjpiJce — time j^r-ecision 

— 4-H-— I— -I— +-4-- 1-— I— -h-f-H-- 

1 St spilce in ttie raster — plot 



nSTon clistingiaislnal3le spike — time 



Stibseqxaent spike — time 



Figure 2: Evaluating the information in a set of spike times. See text for details. 

As a consequence, no more than one spike every r milliseconds can be introduced in this temporal his- 
togram of 6t box width, as illustrated in Fig. 2. In a [0, T] time range, there are T/5t choices for the first 
spike, less than T/5t — 1 for the second etc... This means that for the T/r maximal number of spikes, they 
are less than {T/5t)^^^ choices. 

Assuming, as a maximal case, that each neuron is independent, we obtain the proposed bound. 

This is a rough upper bound that does not take into account constraints imposed by the dynamics at the network level. 
These constraints further reduce the available information. In fact, the dynamics of a given network does constraint very 
much the possible spike trains, and the real entropy may be lower, or even strongly lower, than this bound. 

In the particular case of fast-brain mechanisms, where only "the first spikes matter" (Thorpe & Fabre-Thorpe, 2001), 
this amount of information is not related to the permutations between neuron spikes, i.e. of order of o(log(-/V!)) = 
N \og{N) but simply proportional to N, in coherence to what is found in (Vieville & Crahay, 2004). 

The latter bound is coherent with several results presented in (Rieke et al., 1996) where the authors consider firing 
rates and use entropy as information measure. For instance, considering a timing precision of 0.1 — 1ms as set here, 
the authors obtain an information rate bounded around 5006if s/s for a neural receptor. This number has the same order 
of magnitude, as obtained by the previous general bound. But the network dynamics itself introduces more specific 
constraints, thus yielding an information rate lower than predicted by the previous bound. However, we see here that 
effective information rate is not an order of magnitude lower: In practice, the dynamics looks Ike rich enough to maintain 
a high information rate. 

This information bound is not a bad, but a good news. The result means that different informations are necessarily 
represented by distinguishable spiking patterns. In other words there is a well-defined margin between two different in- 
formation representations. The neuronal coding with large margins is discussed in (Vieville & Crahay, 2004), and may 
explain the surprisingly impressive performance of fast brain categorization. This corresponds to introduce an incom- 
pressible margin between two informations, which guaranties a robust coding. 
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4 Dynamics of time-constrained networks 



A step further, taking [Cl-3] into account, allows us to "discretize" the spike train sequences. A raster is formally defined 

as follows: To each neuron of index i a binary variable cji(fc) G {0, 1} is associated such that the neuron fires during the 
fc-th sampling period if and only if ijJi{k) = 1 and is silent otherwise. The sampling period is taken smaller than r, 6t and 
dt. Smaller than r in order to have either 1 or spike during a samphng period. Smaller than 5t in order that the sampling 
does not impair the spike-time precision. Smaller than dt since, in a discrete time system, the information if propagated 
from one sampling period to another through recurrence relations. 

In simple models such as basic leaky integrate and fire (LIF) or integrate and fire neuron models with conductance 
synapses and constant external current (gIF), a full characterization of the network dynamics can be derived from such a 
discretization. For these two neuron models, it has been shown that (Cessac, 2008; Cessac & Vieville, 2008): 

• [PI] The raster is generically^ periodic, but, depending on parameters such as constant external current or synaptic 
weights, periods can be larger than any accessible computational time; 

• [P2J There is a one-to-one correspondence between orbits^ and rasters (i.e. a raster provides a symbolic coding for 
the network dynamics). 

Note that [PI] and [P2] are properties of usual integrate and fire network models with constant parameters (weights, 
delays, etc.). 

The fact [PI] gives way to clearly understand to which extends spike trains can code information: Periodic orbits give 
the code. When the parameters vary, the orbits change accordingly but are still periodic (with possibly very large periods). 

The fact [P2] means that, in the LIF and glF cases, the raster is a "symbolic coding" in the sense that no information 
is lost by considering the spike times instead of the membrane potential variations. 

Both facts also allow one to deeply understand the network dynamics: Fig. 3 sketches out some aspects, illustrating 
the global behavior of the system and illustrating that attractors are generically stable period orbits. More precisely, the 
dynamics is piece-wise continuous, i.e. continuous expect when a spike is fired. The dynamics is locally contracting. 
Furtermore, after each neuron has fired once the dynamics is no longer dependent on the initial conditions. Nevertheless, 
when the membrane potential is close to the threshold, a small perturbation may induce drastic changes in the dynamics, 
while it is otherwise damped. This behaviour corresponds to a notion of "edge of chaos" which is precisely defined within 
this framework (Cessac, 2008; Cessac & VieviUe, 2008), although this definition differs from the usual notion of chaos in 
differentiable systems (the terminology "stable chaos" has been proposed by (PoUti & Torcini, 2009)). 

Remarks 

Time is discretized, but without any constraint about the "sampling period". The [PI] and [P2] results hold at any 
finite precision. However, to which extends the period of the periodic orbits does not depend on the samphng period, 
providing the samphng period is small enough, or more generally periodic orbits dependence with respect to the samphng 
period is still an open issue. 

In order to understand [PI], it might be important to discuss how "obvious" it is. Time is discretized. If the membrane 
potential would have been discretized also, this would have reduced to a finite state system. In that case, only fixed points 
and periodic orbits could occur and the result would have been obvious. As a consequence, [PI] reads: Even if the neuron 
state is described by continuous values, orbits are still generically periodic. 

'Considering a basic leaky integrate and fire neuron network the result is true except for a negligible set of parameters. Considering an integrate and 
fire neuron model with conductance synapses the result is true unless the trajectory accumulates on the threshold from below. 

*Here we consider orbits, i.e. infinite trajectories, thus consider this deterministic system, with constant input, in its asymptotic stage. 
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Parameters variation: y— > '>h-5y 



Figure 3: Describing the basins of attraction of the dynamic landscape, for deterministic time-constrained networks. [A] The phase 
space (in other words the space of the network states) is partitioned into bounded domains Bi and for each initial condition in Bi the 
initial trajectory is attracted, not towards a fixed point (as in Hopfield networks with asynchron dynamics), but towards a periodic orbit 
Ai. [B] If the parameters (external input, weights) change, the landscape is modified and several phenomena can occur; change in the 
attractor's shapes, number of attractors, as for ^3 in this example; A point belonging to A4 in Fig.3 A, can, after modification of the 
parameters, converge either to attractor A2 or As- 

In a conductance based model, with the additional constraint that conductances depend on previous spikes within a 
finite horizon, it appears that [PI] still holds, although this is intuitively less obvious (Cessac & Vieville, 2008). 

To which extends such a "canonical situation" is still true for more complex models is an open question. We can easily 
conjecture that [PI] is a model Umitation for all integrate and fire models, providing they are defined with an instantaneous 
reset to a constant value. The question is still open for SRM models. 

The [P2] statement can be explained as follows. Changing the initial value of the membrane potential, one may expect 
some variability in the dynamics. But due the reset, close-by distinct trajectories can be collapsed onto the same trajectory, 
after a finite time. As a result, the membrane potential evolution then depends only on the previous spike times, instead 
of the previous membrane potential values (Cessac, 2008). 

Since periods exhibited by integrate and fire models can be arbitrary large, depending on parameters such as synaptic 
weights, it is likely that rasters produced by these models can approach rasters produced by more realistic models such 
as Hodgkin-Huxley, for a finite horizon. However this suggestion is a conjecture only. This property is reminiscent of 
the shadowing lenrnia of dynamical systems theory (Katok & Hasselblatt, 1998) stating that chaotic orbits produced by a 
uniformly hyperboUc system can be approached arbitrary close by periodic orbits. 

5 Neural coding and spike train metrics 

In a biological as well as a computational context, the analysis of experimental or simulation data often requires a compar- 
ison between two or several spike trains. Either the spike trains concern a given neuron and result from several repetitions 
of a same experiment, or the spike trains have been generated by different neurons during a given time range, in a unique 
experiment. In both cases, the idea is to look for invariants, or differences, in the underlying neural code. In the present 
section and the next two, we study the relation between neural coding and different spike train metrics. 

As an illustrative example, let us consider the temporal order coding scheme (Gautrais & Thorpe, 1998; Thorpe & 
Fabre-Thorpe, 2001) (i.e. rank coding): Only the order of the events matters, not their specific time values. Two spike 
trains J^i, Ti with the same event ordering correspond to the same code. This assertion defines an "equivalence relation" 
which structures the set of all the spike trains into a partition: every spike trains in a same equivalence class correspond 
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to the same "code". 

Similar definitions can be given for other coding methods. For instance, rate coding means that all spike trains with 
the same frequency are in the same equivalence class, irrespective of their phase. 

Let us now reconsider the question of neural coding under the light of the time constraints discussed in previous 
sections. The fact that spike time precision is not unbounded leads to many indistinguishable orderings. This does not 
change the rank coding concept, while the partition is now coarser: Trains with two spikes occuring at indistinguishable 
times are in the same equivalence class. 

Let us now introduce the notion of spike train metric. The basic idea consists of defining a "distance" d{.), such that 
^2) = if ^1 and ^2 correspond to the same code, and 1 otherwise. 

A step further, how can we capture the fact that, e.g. for rank coding, two spike times with a difference "about" 5t are 
"almost" indistinguishable ? The natural idea is to use a "quantitative" distance instead of a discrete distance (i.e. with 
binary 0/1 values): Two spike trains correspond exactly to the same neural code if the distance is zero and the distance 
increases with the difference between the trains. 

This is the idea we wanted to highlight here. This proposal is not a mathematical "axiomatic", but a simple modelling 
choice. The principle is far for being new, but rather surprisingly it has not been explicited at this level of simplicity. In 
order to see the interest of the idea, let us briefly review the main classes of spike train metrics. 

As reviewed in details in (Schrauwen, 2007; Victor, 2005) spike trains metrics can be categorized in three classes: 
-0- "Bin" metrics, based on grouping spikes into bins (e.g. rate coding metrics): Not discussed here. 
- 1 - Convolution metrics, including the raster-plot metric: Discussed in Section 6. 
-11- Spike time metrics, such as alignment distances (Victor & Purpura, 1996): Discussed in Section 7. 

6 Using convolution metrics to link spike trains and continous signals 

Linear representation. A large class of metrics is defined through the choice of a convolution kernel Ki applied to a 
spike train function written pi{t) = X]t"GJ^ ~ where 5{.) is the Dirac distribution. For a given spike train J^i, the 
equation is: 

Si{t)= J2 Ki{t-t'i)=Ki*pi{t) e[0,l], 

The signal Si is easily normalized between (no spike) and, say, 1 (burst mode at the maximal frequency). 

The distance between two spike trains is then defined by applying some norm to the continuous signal s = 
(• • • , Sj, • • • ), at the network level. The "code" here corresponds to the hnear representation metric: the codes are similar 
if the related continuous signals are similar. It allows us to link spike trains with a continuous signal s. 

The so-called "kernel methods" based on the Mercer theorem (Schrauwen, 2007) are in direct links with the Unear 
representation since they are defined, as scalar products, writing: 

i n,m * 

with direct correspondences for usual kernels with Hnear convolutions, e.g.: 





Triangular 


Exponential 


Gaussian 




\lin{t (l-t)) 




/ 2 A „-2 A^ 
V 




max (1 - -1 |d|,0) 







where H is the Heaviside function. Distances based on inter-spike intervals are also included, as developed in e.g. (Kreuz, 
Haas, MorelU, Abarbanel, & Politi, 2007). 
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Figure 4: A few examples of spike train convolution: [A] The spike train itself, [B] A causal local frequency measure estimation 
(writing x the indicatrix function), [C] A non-causal spike density, uniformly equal to 1 in burst mode, [D] A normalized causal 
exponential profile, parameterized by a decay time r. Evoked post-synaptic potential profiles are nothing but such causal convolution 
(using e.g. double-exponential kernels to capture the synaptic time-constant (weak delay) and potential decay). Similarly spike-trains 
representations using Fourier or Wavelet Transforms are intrinsically related to such convolutions. 



Non static kernels of the form Kl{t — t") (i.e. depending on t) can also be used (clock-dependent coding, raster, 1st 
spike coding, ...), while non-linear Volterra series are useful for representing "higher order" phenomena (see e.g. (Rieke 
et al., 1996)). 

These linear representations not only provide with tools to compare different spike trains, but allows one to better 
understand the link between continuous signals and spike times. For instance (Dayan & Abbott, 2001; Maass, 1997), 
writing s{t) = J2i ^i^iit) is a mean to define some network readout to link spiking networks to "analog" sensory-motor 
tasks. Let us illustrate this aspect by the following results. 

Kernel identification. Given a causal signal Sj generated by a spike train J^i at the unit level, the problem of identifying 
the related kernel is formally solved by the following paradigm: 

min^. / \si{t) - Si{t)\^ dt = [ \K,{X) p,{X) - Si{X)\UX, 
Jt>o Jx 

using the Laplace transform Parseval theorem (here, A is the Laplace domain variable), thus: 

Ki{X) = [si{X) MXf] [p,{X) p,{xf]-' 

i.e. the spike train cross-correlation versus auto-correlation ratio. Non-causal estimation would consider the Fourier 
transform. This setting corresponds to several identification methods (Dayan & Abbott, 2001; Schrauwen, 2007). 

The paradigm is to be used, for instance, for identifying the average synaptic response profile from the observation 
of the input spike train and synaptic evoked potential output. Given the observation of a spike train function pi and the 
related response Si the previous formula allows one to estimate the related kernel. 
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Spike deconvolution. A step further, if we know the convolution kernel Ki, it is obvious to formally write pi = Li * Si 



with Li 



TW7 



writing F the Fourier transform e.g.: 



well defined and allowing one to reconstruct the spike-train from the continuous signal as illustrated in Fig. 5. 



before 



after 





Figure 5: A small experiment of spike deconvolution. On the left the signal is the convolution of a spike-train using an a{t) = 
t/re~*^'^ profile, with addition of noise and of a spurious sinusoid which has been added as an outlier to the signal. Spikes are not 
"visible" in the sense that they do not correspond to maxima of the signal because the spike responses are mixed. On the right the 
deconvolution is shown: the outlier is amplified, but spikes clearly emerge from the signal. 

The good news is that the inverse convolution filters Li are not singular so that the deconvolution is well-defined and 
in explicit form. However, this requires the use of derivative filters, known as sensible to noise. Unpublished numerical 
investigations have shown that as soon as the error on the kernel profiles is higher than 10 — 20%, several spikes are lost 
in the deconvolution. 



Signal reconstruction. In order to further understand the power of representation of spike trains (Lazar, 2005) has gen- 
eralized the well-known Shanon's theorem, as follows: A frequency range [— il, il] signal is entirely defined by irregular 
sampling values s" at spike times tf 

n 

with 

TT I 

provided that maXnC?" < ^. 

This supplies an explicit signal "decoding", since given any signal s it provides an explicit formula to represent this 
signal by a convolution kernel K and a spike train. 



Raster metrics. A step further, it is easy to see that representing the spike time by a raster corresponds to a non-static 
convolution kernel. A given raster can be represented by a real number in [0, 1[, the binary representation of its decimal 
part being the spike train itself. Using this representation, a useful related metric is of the form, for 9 g]0, 1[: 

de{uj,tj') ^ 9'^,T ^ argmaX( uj* w'*, 

thus capturing the fact that two rasters are equal up to a certain rank. Such metrics can be applied to analyze the dynamics 
of spiking networks and they are typically used in the context of symbolic coding in dynamical systems theory (Cessac, 
2008; Cessac & Vieville, 2008). 
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7 Using alignment metrics to program spiking neuron networks 



The original alignment metric. The second family of metrics we want to review considers spike times directly (Victor 
& Purpura, 1996; Victor, 2005). 

The distance between two finite spike trains T' is defined in terms of the minimum cost of transforming one spike 
train into another. Two kinds of operations are defined: 

• spike insertion or spike deletion, the cost of each operation being set to 1 

• spike shift, the cost to shift from <" e to i/" G T' being set to |t" — ij™|/T for a time constant r. 

For small r, the distance approaches the number of non-coincident spikes, since instead of shifting spikes it is cheaper 
to insert/delete non-coincident spikes, the distance being always bounded by the number of spikes in both trains. 

For high r, the distance basically equals the difference in spike number (rate distance), while for two spike trains with 
the same number of spikes, there is always a time-constant r large enough for the distance to be equal to ^„ |i" — t/'l/r. 

Here, two spike times are comparable if they occur within an interval of 2 t, otherwise they had better to be deleted / 
inserted. 

Although computing such a distance seems subject to a combinatorial complexity, it appears that quadratic algorithms 
are available (i.e. with a complexity equal to the product of the numbers of spikes). This is due to the fact that, in a 
minimal path, each spike can be either deleted or shifted once to coincide with a spike in the other spike train. Also, a 
spike can be inserted only at a time that matches the occurrence of a spike in the other spike train. It allows us to calculate 
iteratively the minimal distance considering the distance d„ „' (JF, T') between a spike train composed of the first n spikes 
of T and the first n' spikes of T' . 



Figure 6: An example of minimal alignment from the upper to the lower spike train, using from top to bottom an insertion, a rightward 
shift, a leftward shift and a deletion respectively. 

When considering spike trains with more than one unit, an approach consists to sum the distances for each alignment 
unit-to-unit. Another point of view is to consider that a spike can "jump", with some cost, from one unit in T to another 
unit in T' . The related algorithmic complexity is no more quadratic but to the power of the number of units (Aronov, 
2003). 

This family of metrics include aligments not only on spike times, but also on inter-spike intervals, or metrics which 
are sensitive to patterns of spikes, etc... They have been fruitfully applied to a variety of neural systems, in order to 
characterize neuronal variability and coding (Victor, 2005). For instance, in a set of neurons, that act as coincidence 
detectors, with integration time (or temporal resolution) r, spike trains will have similar postsynaptic effects if they are 
similar w.r.t. this metric. 

Generalization of the alignment metric. Let us remark, here, that the previous metric can be generaUzed as follows: 

- [causality] At a given time, the cost of the alignment of previous spikes decreases with the obsolescence of the spike, 
say, with an exponential profile parameterized by a time-constant r'. At the infinity limit for r', the original alignment 
metric is retrieved. 
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[non-linearity] The cost of a shift is not necessarily a hnear function of ' ^ ^ ^ ' , as in the original metric, but any 
suitable non-linear function ' ^ — ^ 



For instance, we may choose a small quadratic profile when lower that the time precision (accounting for additive 
noise, but implementing the fact that spike time differences are neghgible), and then, a hnear profile. 
This leads to an iterative definition of the previous distance dn,n' '■ 



dn,n' = min 



^ dn-l,n' + 1, 

nax(tp,t-")-min(t"-l,t 
2 £ , 2 i 



^71-1,1 



with, e.g., = min (d, {dr/dt)'^), again implementable in quadratic time. It corresponds to the original alignment 
metric iff is the identity function and r' = +00, stiU calculable with a quadratic complexity. 

This modified version of the metric iUusttates how versatile is this class of distances for representing the differences 
between spike trains. 



Weight training from spike times. As a formal apphcation, let us consider a Spike Response Model neuron (Gerstner 
& Kistler, 2002a) of the form: 

Vi{t) = v(t - tr') + E,™ «(i - tT) for tr' <t<t?, 

the spike time being defined by ) = 0, where 9 is the spiking threshold. 

Previous metrics on spike times give way us to optimize the neural weights in order to tune spike times, deriving, e.g., 
rules of the form: 

Such mechanisms of optimization are also applicable to time-constants, delays or thresholds. It appears that this 
method cannot be easily used in practice, since the equation is numerically unstable (Schrauwen, 2007). However, using 
spike train metrics leads to the formalization of such adaptation rules, in order to "compute with spikes". 

Let us further develop this point now. 



8 Implementing spiking neuron networks 

Spiking neuron network models 

In biological context, spiking neuron networks are useful for modelling different areas identified in the brain by neuro- 
physiological experiments and to validate, or invalidate, hypotheses made on their possible functional interactions. For 
instance, in the ANR MAPS project, interactions between Superior Colhculus (SC), Excitatory Burst Neurons (EBN), 
central Reticulate Mesencephalic Formation (cMRF), OmniPause Neurons (OPN) and MotoNeurons (MN) are modelized 
by large size spiking neuron networks in order to explain the control mechanisms of ocular saccades (work in progress). 

In computational context, spiking neuron networks are mainly implemented through specific network architectures, 
such as Echo State Networks (Jaeger, 2003) and Liquid Sate Machines (Maass, Natschlager, & Markram, 2002), that 
are called "reservoir computing" (see (Verstraeten, Schrauwen, DHaene, & Stroobandt, 2007) for unification of reservoir 
computing methods at the experimental level). In this framework, the reservoir is a network model of neurons (can be 
linear or sigmoid neurons, but more usually spiking neurons), with a random topology and a sparse connectivity. The 
reservoir is a recurrent network, with weights than can be either fixed or driven by an unsupervised learning mechanism. 
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In case of spiking neurons (e.g. in the model of (Paugam-Moisy et al., 2008)), the learning mechanism is a form of 
synaptic plasticity, usually STDP (Spike- Time-Dependent Plasticity), a temporal Hebbian unsupervised learning rule, 
biologically inspired. The output layer of the network (the so-called "readout neurons") is driven by a supervised learning 
rule, generated from any type of classifier or regressor, ranging from a least mean squares rule to sophisticated discriminant 
or regression algorithms. The ease of training and a guaranteed optimality guides the choice of the method. It appears 
that simple methods yield good results (Verstraeten et al., 2007). This distinction between a readout layer and an internal 
reservoir is indeed induced by the fact that only the output of the neuron network activity is constrained, whereas the 
internal state is not controlled. 

Calculability of neural networks 

Let us now consider the calculability of neuron network models. It is known that recurrent neuron networks with frequency 
rates are universal approximators (Schafer & Zimmermann, 2006), as multilayer feed-forward networks are (Hornik, 
Stinchcombe, & White, 1989). This means that neuron networks are able to simulate dynamical systems, not only to 
approximate measurable functions on a compact domain, as originally stated (see, e.g., (Schafer & Zimmermann, 2006) 
for a detailed introduction on these notions). Spiking neuron networks have been proved to be also universal approximators 
(Maass, 2001). 

Learning the parameters of a spiking neuron networks 

In biological context, learning is mainly related to synaptic plasticity (Gerstner & Kistler, 2002a; Cooper, Intrator, Blais, 
& Shouval, 2004) and STDP (see e.g., (Toyoizumi, Pfister, Aihara, & Gerstner, 2007) for a recent formalization), as far 
as spiking neuron networks are concerned. This unsupervised learning mechanism is known to reduce the variability 
of neuron responses (Bohte & Mozer, 2007) and related to the maximization of information transmission (Toyoizumi, 
Pfister, Aihara, & Gerstner, 2005) and mutual information (Chechik, 2003). It has also other interesting computational 
properties such as tuning neurons to react as soon as possible to the earliest spikes, or segregate the network response in 
two classes depending on the input to be discriminated, and more general structuring such as emergence of orientation 
selectivity (Guyonneau, vanRullen, & Thorpe, 2004). 

In the present study, the point of view is quite different: we consider supervised learning while, since "each spike may 
matter" (Guyonneau et al., 2004; Delorme, Perrinet, & Thorpe, 2001), we want not only to statistically reproduce the 
spiking output, but also to reproduce it exactly. 

The motivation to explore this track is twofold. On one hand we want to better understand what can be learned at a 
theoretical level by spiking neuron networks, tuning weights and delays. The key point is the non-learnability of spiking 
neurons (Sima & Sgall, 2005), since it is proved that this problem is NP-complete, when considering the estimation of 
both weights and delays. Here we show that we can "elude" this caveat and propose an alternate efficient estimation, 
inspired by biological models. 

We also have to notice, that the same restriction apply not only to simulation but, as far as this model is biologically 
plausible, also holds at the biological level. It is thus an issue to wonder if, in biological neuron networks, delays are 
really estimated during learning processes, or if a weaker form of weight adaptation, as developed now, is considered. 

On the other hand, the computational use of spiking neuron networks in the framework of reservoir computing or 
beyond (Schrauwen, 2007), at application levels, requires efficient tuning methods not only in "average", but in the 
deterministic case. This is the reason why we must consider how to exactly generate a given spike train. 
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Weak estimation of networli parameters 

As pointed out previously, the non-learnability of spiking neurons is known (Sima & Sgall, 2005), i.e. the previous 
estimation is proved to be NP-complete. This means that in order to "learn" the proper parameters we have to "try all 
possible combinations of delays". This is intuitively due to the fact that each delay has no "smooth" effect on the dynamics 
but may change the whole dynamics in an unpredictable way. 

This is the way proposed to elude this NP-complete problem by considering another estimation problem. Here we do 
not estimate one delay (for each synapse) but consider connection weights at several delays and then estimate a balancing 
of their relative contribution. This means that we consider a weak delay estimation problem. 

The alternative approach is to estimate delayed weights, i.e. a quantitative weight value Wijd at each delay d & {!,£>}, 
using e.g. a model of the form: 

n D 

Vi[k] = 7i Vi[k - 1] (1 - Zi[k - 1]) + ^ ^ Wijd Zj[k -d\+ lik. 

Obviously, the case where there is a weight Wij with a corresponding delay dij G {0, D} is a particular case of 
considering several delayed weights, since we can write: 

Wijd = Wij d{d- dij), 

being the Kronecker symbol in this case. In other words, with our weaker model, we are stiU able to estimate a neuron 
network with adjustable synaptic delays. 

We thus do not restrain the neuron network model by changing the problem, but enlarge it. In fact, the present 
estimation provides a smooth approximation of the previous NP-complete problem. 

It has been made explicit in (Rostro-Gonzalez, Cessac, Vasquez, & Vieville, 2009) that the parameter estimation of 
such a neuron network in order to generate a given spike train, is a Linear (L) problem if the membrane potentials are 
observed and a Linear Progamming (LP) problem if only spike times are observed, with a gIF model. Such L or LP 
adjustment mechanisms are distributed and have the same structure as an "Hebbian" rule. A step further, this paradigm 
is easily generalizable to the design of input-output spike train transformations. This means that a practical method is 
available to "program" a spiking network, i.e. to find a set of parameters allowing us to exactly reproduce the network 
output, given an input. 

Polychronization and limitations of metrics A spiking network can polychronize, i.e., exhibit reproducible time- 
locked but not synchronous firing patterns within 1 milUsecond precision. Polychronization can be viewd as a general- 
ization of the notions of synchronization and synfire chains. Due to the interplay between the delays and a form synaptic 
plasticity (can be implemented by way of STDP - see Sectionfef), the spiking neurons spontaneously self-organize into 
groups and generate patterns of stereotypical polychronous activity. 

In (E. M. Izhikevich, 2006), it has been shown that the number of co-existing polychronous groups far exceeds the 
number of neurons in the network, resulting in an unprecedented memory capacity of the system. The author speculates 
on the significance of polychrony to the theory of neuronal group selection and cognitive neural computations. 

In (Paugam-Moisy et al., 2008), the network processing and the resulting performance is explained by the concept 
of polychronization. The model emphasizes that polychronization can be used as a tool for exploiting the computational 
power of synaptic delays and for monitoring the topology and activity of a spiking neuron network (Martinez & Paugam- 
Moisy, 2008). 

Taking such complex aspects of the neural code into account cannot be performed by any available metrics. New 
metrics, taking long term interactions into account have to be developed and this is a challenging issue. 
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9 Conclusion 



This article has reviewed a set of indisputable facts that could help better understanding to which extend computing and 

modelling with spiking neuron networks can be biologically plausible and computationally efficient. The links between 
spike trains and neural coding have been highlighted, with the help of several metrics and under a set of time constraints 
as hypotheses. 

Although probabilistic measures of spike patterns such as correlations (Gerstner & Kistler, 2002a) or entropy based 
pseudo-distances (e.g. mutual information) provide a view of spike trains variability which is enriched by the information 
theory conceptual framework, it may be difficult to estimate them in practice, since such measures are robust only if 
a large amount of samples is available. On the contrary, distances allow to characterize aspects of spike coding, with 
efficient methods and without this curse of the sampling size. 

This review highlights some of these methods and propose to consider that "choosing a coding" means "defining a 
metric". This point of view provides a synthetic insight of several methods applied to spiking neuron networks. To our 
best knowledge, only polychronization mechanisms are not easily represented with such a tool, and it is an interesting 
issue to study the link between these non-local temporal interactions in neuron networks and the underlying neural code. 

Neither "incredible power of spikes" nor "mystery of the [spike based] neural code" here, but some pragmatical and 
practical facts to better understand to which extend computing and modelling using spiking neuron networks can be useful, 
and how to implement such networks in a pertinent way. 
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